Automated Assignment of Backbone NMR Data using Artificial Intelligence
نویسندگان
چکیده
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for the investigation of three-dimensional structures of biological molecules such as proteins. Determining a protein structure is essential for understanding its function and alterations in function which lead to disease. One of the major challenges of the post-genomic era is to obtain structural and functional information on the many unknown proteins encoded by thousands of newly identified genes. The goal of this research is to design an algorithm capable of automating the analysis of backbone protein NMR data by implementing AI strategies such as greedy and A* search. ar X iv :1 50 6. 05 84 6v 1 [ cs .A I] 1 8 Ju n 20 15 1 Nuclear Magnetic Resonance (NMR) Nuclear magnetic resonance is a phenomenon in which atomic nuclei absorb electromagnetic radiation at frequencies related to their chemical properties and the local molecular environment. Biophysicists use this property to gain structural knowledge of biomolecules, including proteins, DNA and RNA. NMR spectroscopy is currently the only method that allows the determination of atomic-level structures of large biomolecules in aqueous solutions similar to their in vivo physiological environments. Several types of NMR experiments can be used in the analysis of protein structures. In particular, essential information is provided by the chemical shifts of NMR-active nuclei present in proteins, including hydrogen and isotopes of carbon and nitrogen. The chemical shift is a quantifier for the deviation in the resonant frequency of a nucleus from its value in a structure-free environment, and therefore provides information on the local conformation. Determining the chemical shifts of all or most of the nuclei in a biomolecule is the first step in determining its structure. 1.1 NMR Assignment Methodology An important set of chemical shifts in a protein are those corresponding to the backbone nuclei, including the nitrogen, attached hydrogen, and the alpha and beta carbon atoms (Cα and Cβ) of each of the residues. The backbone residues constitute the building blocks of a protein chain (Figure 1). These checmical shift signals are measured using various NMR experiments, and then matched to the individual residues in the protein in a process called sequential assignment. Figure 1: HNCACB NMR experiment. A prerequisite of the assignment process is data collection using experiments that can provide information about the connectivities between neighboring residues [1]. One such experiment, called HNCACB, yields signals corresponding to the Cα and Cβ nuclei of one residue in the protein (residue i) , plus the Cα and Cβ signals of the immediately preceding residue (residue i− 1) (Figure 1). A second experiment, CBCA(CO) NH, can be used to yield the chemical shifts of the preceding residue only. These experiment are not independent, so they allow scientists to distinguish unambiguously between signals from residue i and residue i− 1. Analysis of all inter-residue connectivities allows the linking of signals from each backbone atom with signals from their preceding neighbor, creating a pattern of sequentially linked chemical shift values which reflects the sequential linear arrangement of the
منابع مشابه
Automated backbone assignment of labeled proteins using the threshold accepting algorithm.
The sequential assignment of backbone resonances is the first step in the structure determination of proteins by heteronuclear NMR. For larger proteins, an assignment strategy based on proton side-chain information is no longer suitable for the use in an automated procedure. Our program PASTA (Protein ASsignment by Threshold Accepting) is therefore designed to partially or fully automate the se...
متن کاملTowards Automated Structure-Based NMR Resonance Assignment
We propose a general framework for solving the structurebased NMR backbone resonance assignment problem. The core is a novel 0-1 integer programming model that can start from a complete or partial assignment, generate multiple assignments, and model not only the assignment of spins to residues, but also pairwise dependencies consisting of pairs of spins to pairs of residues. It is still a chall...
متن کاملFast high-resolution protein structure determination by using unassigned NMR data.
NMR spectroscopy provides high-resolution structural information of biomolecules in near-physiological conditions. Although significant improvements were achieved in NMR spectroscopy in the last 20 years, the increase in genome sequencing data has created a need for rapid and efficient methods of NMR-based structure determination. 3] NMR data acquisition can be accelerated significantly when se...
متن کاملTowards Fully Automated Structure-Based NMR Resonance Assignment of 15N-Labeled Proteins From Automatically Picked Peaks
In NMR resonance assignment, an indispensable step in NMR protein studies, manually processed peaks from both N-labeled and C-labeled spectra are typically used as inputs. However, the use of homologous structures can allow one to use only N-labeled NMR data and avoid the added expense of using C-labeled data. We propose a novel integer programming framework for structure-based backbone resonan...
متن کاملRobust structure-based resonance assignment for functional protein studies by NMR
High-throughput functional protein NMR studies, like protein interactions or dynamics, require an automated approach for the assignment of the protein backbone. With the availability of a growing number of protein 3D structures, a new class of automated approaches, called structure-based assignment, has been developed quite recently. Structure-based approaches use primarily NMR input data that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1506.05846 شماره
صفحات -
تاریخ انتشار 2015